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Doss  (J.  Multiv.  Anal.  9  (1979),  460-464)  considered  distributions 
on  (0,1 ,2, . . . }m ,  whose  probability  generating  functions  were  of  the  form 


C(Sl,...,sm)  =  fa„  *  I  aisi  *  J.  ‘ijVj  S«}'Y 

where  y  >  0.  Whenever  G  is  a  probability  generating  function,  except  in 
degenerate  cases,  the  corresponding  distribution  is  an  m-variate 
distribution  with  negative  binomial  distributions  for  its  univariate 
marginals.  Apart  from  a  few  comments  in  the  bivariate  case,  Doss  gave  no 
discussion  of  either  necessary  or  sufficient  conditions  on  the  a’s  for  G 
to  be  a  probability  generating  function.  This  paper  discusses  the  known 
sufficient  conditions  and  obtains  various  necessary  conditions.  It  is 
shown  that  the  region  of  convergence  of  the  associated  power  series 
expansion  about  the  origin  plays  a  key  role.  Attention  is  concentrated 
on  the  bivariate  case:  even  here  it  seems  difficult  to  obtain  conditions 
which  are  both  necessary  and  sufficient. 
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\  1 .  INTRODUCTION 

'V 

The  negative  binomial  distribution  arises  in  probability  as  the 
distribution  of  the  waiting  time  to  achieve  a  specified  number  of 
successes  in  a  sequence  of  Bernoulli  trials.  In  addition,  it  has  been 
widely  used  in  statistics  as  a  model  for  a  variety  of  data  involving 
counts.  (Johnson  and  Kotz  ((1969),  Chapter  5)  and  Douglas  (1979)  give 
detailed  discussion  of  this  distribution  and  its  properties. 

7?Multi variate  analogues  of  the  negative  binomial  distribution  are  of 
interest  as  joint  distributions  of  waiting  times  in  Bernoulli  trials  and 
for  modelling  data  involving  pairs  or  m-tuples  of  possibly  dependent 
counts . 

c*  1  m 

.• f  r  This  paper  considers  m-variate  distributions  concentrated  on  2Z+  = 

(ZZ+)m  where  2Z+  =  {0,1,2,...}  and  having  negative  binomial  univariate 

marginal  distributions.  Such  distributions  will  be  called  multivariate/ 

m-variate  negative  binomial  distributions .  This  terminology  is  consistent 

with,  for  example,  that  of  Johnson  and  Kotz  (1969)  and  Lancaster  (1969), 

where  multivariate  x  distributions  have,  by  definition,  univariate 

marginals  of  type  x.  Not  all  authors  have,  however,  adopted  this  usage. 

For  example,  Doss  (1979)  employs  a  restricted  definition  which  calls  a 

distribution  m-variate  negative  binomial  iff  it  has  a  joint  probability 

generating  function  (p.g.  fn.)  of  the  form 

U(sr...,sm)  =  {H(s1 . sm)}’Y  >  (1) 

where  y  >  ()  is  not  necessarily  an  integer,  and  H  is  of  the  form 
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If  (1)  is  the  p.g.  fn.  of  an  m-variate  distribution  which  is 
non -degenerate  (in  the  sense  that  it  is  not  concentrated  on  some  subspace 
of  ]Rm  having  dimension  strictly  less  than  m) ,  then  all  of  its  univariate 
marginal  distributions  are  negative  binomial.  In  the  case  m  =  2, 
distributions  of  this  general  form  had  been  studied  previously  by  Wiid 
(1958)  and  Edwards  and  Gurland  (1961) .  (See  also  the  discussion  towards 
the  end  of  §2  and  early  in  §4  of  the  present  paper.) 

It  is  not  hard  to  see  that  distributions  of  this  form  do  not,  for 
any  fixed  m,  exhaust  the  class  of  all  m-variate  negative  binomial 
distributions.  Several  types  of  counterexample  can  be  constructed.  One 
type  can  be  obtained  by  appropriate  shifts  of  probability  mass  starting 
from  a  distribution  with  p.g.  fn.  of  the  form  (1).  Consider  the  case  m  = 
2.  It  is  enough  to  start  from  a  p.g.  fn.  of  the  form 

G(s1,s2)  =  g1(s1)G2(s2)  ,  (3) 


where  G^(s)  =  (a.+b^s)  \  i  =  1,2,  are  (univariate)  negative  binomial 

t 

p.g.  fns.  with  the  same  index  y  >  0.  Then  for  e  suitably  small,  one  can 
always  shift  mass  e  away  from  each  of  the  points  (0,0), (1,1)  and  place  a 
further  mass  of  c  at  each  of  (1,0),  (0,1).  Such  a  shift  of  mass  clearly 
preserves  the  marginal  distributions  but  not  the  independence  which  is 
implicit  in  (3).  Assuming  e  >  0,  it  is  easy  to  see  that  the  new 


distribution  exhibits  negative  covariance;  alternatively  allowing  e  <  0 
would  introduce  positive  covariance.  The  p.g.  fn.  of  this  distribution  is 
easily  written  down  and  is  clearly  not  of  the  form  (1)  with  m  =  2. 

Another  type  of  counterexample  can  be  constructed  by  mixing.  Let 
Gj,G^  be  any  two  distinct  bivariate  p.g.  fns.  of  the  form  (1)  with 
corresponding  marginals  identical.  Then 

G(sps2)  =  !j[G1(s1,s2)  +  G2(s1,s2)] 

is  a  bivariate  p.g.  fn.  whose  marginal  p.g.  fns.  are  exactly  those  of  G^ 
and  G?  but  whose  form  cannot  be  given  by  (1). 

A  further  type  of  counterexample  can  be  provided  within  the  family 
of  distributions  that  variously  bears  the  names  of  Lyraud,  Farlie, 

Gumbel,  and  Morgenstem:  see,  for  example,  Cambanis  (1977). 

Nevertheless,  as  evidenced  at  least  by  the  results  of  Theorem  1  and 
Theorem  2  of  Doss's  paper,  multivariate  negative  binomial  distributions 
with  p.g.  fns.  of  the  form  (1)  do  have  a  special  interest.  Doss's  main 
result  (Theorem  1)  can  be  expressed  in  our  terminology  as  follows: 

PROPOSITION'.  Within  the  class  of  all  m-variate  negative  binomial 
distributions ,  the  subclass  consisting  of  all  those  which  are  m-variate 
linear  exponential  distributions  coincides  with  the  subclass  of  those 
whose  p.g.  fns.  are  of  the  form  (1). 

The  statistical  importance  of  this  characterization  of  distributions 
whose  p.g.  fns.  are  of  the  form  (1)  flows  from  the  importance  in 
statistical  inference  of  linear  exponential  distributions;  families  of 
such  distributions  are  the  mathematically  and  statistically  "nice" 
families  of  distributions:  see,  for  example,  Johansen  (1979). 
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In  spite  of  the  characterization  expressed  by  the  above  proposition, 
there  is  apparently  no  known  characterization  of  the  class  of  a's  for 
which  (1),  with  H  given  by  (2),  is  a  p.g.  fn.  In  the  next  section  we 
discuss  known  sufficient  conditions  through  a  relationship  with 
infinitely  divisible  multivariate  Poisson  distributions.  Section  3 
attacks  the  problem  of  obtaining  necessary  conditions:  the  approach 
involves  consideration  of  the  convergence  behaviour  of  power  series  in 
several  variables  and  necessary  conditions  for  such  series  to  have 
non-negative  coefficients.  Such  convergence  questions  do  not  seem  to 
have  received  attention  in  the  probability  literature.  The  basic 
technique  for  obtaining  necessary  conditions  for  the  non-negativity  of 
the  coefficients  was  used  first  in  [19]  and  subsequently  in  [25]  and 
f 26 1 .  We  establish  by  this  means  several  sets  of  necessary  conditions, 
one  of  which,  (9),  may  well  be  sufficient.  Details  are  presented  mainly 
in  the  bivariate  case.  The  final  section  mentions  a  possible 
characterization  of  p.g.  fns.  of  the  form  (1)  by  infinite  divisibility: 
an  analogue  of  the  characterization  for  multivariate  Poisson 
distributions  that  is  quoted  in  Section  2.  In  addition,  it  discusses  a 
special  subclass  of  p.g.  fns.  of  the  form  (1)  that  has  been  somewhat  more 
extensively  studied  in  the  literature. 

2.  SUFFICIENT  CONDITIONS 

In  order  for  a  function  G  of  the  form  (1),  with  H  given  by  (2),  to 
be  the  p.g.  fn.  of  a  distribution  on  2Z™,  it  is  sufficient  that  a^  be 
positive  and  that  all  the  other  a's  be  non-positive.  If  degeneracy  is  to 
be  avoided,  some  further  restrictions  are  needed:  we  will  comment  later 
on  these  in  the  bivariate  case.  The  sufficiency  is  readily  seen  by 
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considering  an  appropriate  mixture  of  a  particular  type  of  multivariate 
Poisson  distribution. 

It  is  known  (cf.  Patil  and  Joshi  (1968),  p.  82;  Johnson  and  Kotz 
(1969),  p.  300)  that 


exp(H(s1,... ,  sm)}  , 


(4) 


where  H  is  given  by  (2)  with  a^  >  0  and  all  the  a's  other  than  a^ 

non-negative,  is  the  p.g.  fn.  of  a  multivariate  distribution  with  Poisson 

marginals.  In  fact  a^,  which  is  determined  by  the  condition  that  the 

probabilities  must  sum  to  one,  is  given  by  -a^  -  the  sum  of  all  the  other 

a's.  The  general  form  of  such  distributions  was  apparently  first  noted 

by  Lo£ve  ((1950),  p.  84)  and  independently  by  Krishnamoorthy  (1951),  the 

bivariate  case  having  been  considered  earlier  by  Campbell  (1934) .  Dwass 

and  Teicher  (1957)  have  shown  that  the  class  of  all  distributions  with 

p.g.  fns.  of  the  form  (4)  is  exactly  the  class  of  all  infinitely  divisible 

m-variate  distributions  with  Poisson  marginals. 

Suppose  now  that  for  each  multi -index  k,  a^  =  Acx^  where  A  and  otj. 

for  k  *  0  are  non-negative.  Denote  the  resultant  p.g.  fn. ,  obtained  from 

(4),  by  K  (s.  , . . . , s) .  Then  for  any  probability  distribution  F  on  [0,°°), 
a  i  in 

K  defined  by 


K(Sj, .  •  •  ,sm)  /g5  (s^j  . . . ,  s^) F (dA) 

is  another  m-variate  p.g.fn.  IfFis  the  gamma  distribution  with  moment 
generating  function  M(t)  =  (1-t)  Y  where  y  >  0,  then  taking  account  of 
the  condition  aQ  =  -£  a^  ,  one  finds  that  K  is  of  the  form  (1)  with  H 
given  by  (2),  except  that  now  a^.  =  -a^  for  each  multi-index  k  *  0  and 
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ag  =  1  -  otg.  Thus  K  is  of  the  form  (1)  with  ag  >  0  and  -  0  for 
k  *  0.  This  proves  the  assertion  made  at  the  beginning  of  this  section 
since  any  such  K  can  be  obtained  by  the  process  just  described. 

In  the  bivariate  case  this  construction  appears  to  have  been  used  in 
their  studies  of  accident  proneness,  first  by  Arbous  and  Kerrich  (1951) 
in  the  case  of  independent  Poissons,  and  then  by  Edwards  and  Gurland 
(1961)  in  the  case  of  an  arbitrary  infinitely  divisible  bivariate  Poisson 
distribution.  The  m-variate  case  starting  from  independent  Poissons  was 
considered  in  a  similar  context  by  Bates  and  Neyman  (1952) . 

One  might  suspect  that  some  other  mixing  process  could  produce  a 
wider  class  of  p.g.  fns.  of  the  form  (1).  It  is  not  hard  to  check  that  a 
mixing  process  similar  to  the  above  but  with  a^  =  k  *  0  and  the 

A^'s  iid  F  would  not  result  in  a  p.g.  fn.  of  the  form  (1)  starting  from 
the  general  infinitely  divisible  m-variate  Poisson  form  (4).  However,  if 
attention  is  restricted  to  the  case  of  independent  Poissons  and  the 
mixture  then  performed  according  to  the  modified  scheme,  the  resultant 
p.g.  fn.  is  a  product  of  univariate  negative  binomial  p.g.  fns.  This  does 
result  in  a  wider  class  of  p.g.  fns.  since,  as  we  shall  see  in  the  next 
section,  some  of  the  a^'s  must  now  be  positive. 


3.  NECESSARY  CONDITIONS 

In  this  section,  so  as  to  simplify  the  discussion,  we  will  initially 
restrict  attention  to  the  bivariate  case  of  (1),  i.e.  to  the  form 


G(sPs2)  ~  *a0  +  alSl  +  a2S2  +  a12SlS2 ^ 


where  y  >  0.  The  previously  observed  sufficient  condition  for  (1)  to  be 
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a  p.g.  fn.  reduces  to 

aQ  >  0  and  a^a^a^  <  0  .  (S) 

In  fact,  if  (lb)  is  to  be  a  non -degenerate  bivariate  p.g.  fn.,  we  must 
have,  in  addition,  at  least  one  of  the  inequalities  a^2*0  ,  a^^O 
satisfied,  and  then  each  marginal  will  be  a  negative  binomial 
distribution.  If  we  are  not  concerned  about  possible  degeneracy  but  only 
with  when  a  function  of  the  form  (lb)  is  a  p.g.  fn.,  albeit  perhaps 
degenerate,  then  such  additional  conditions  need  not  be  considered. 

That  the  conditions  (S)  are  not  also  necessary  was  observed  by  Doss 
(1979).  He  pointed  out  that  the  case  of  independence  is  not  covered  by 
(S) :  in  this  case  it  is  easy  to  check  (cf .  Case  II  later  in  this  section) 
that  necessarily,  except  in  degenerate  situations, 

a0’a12  >  0  al,a2  <  0  ‘ 

Having  made  this  point,  Doss  avoided  further  discussion  of  which  a's  were 
permissible  by  allowing  in  his  Definition  1  any  real  a's  consistent  with 
(lb),  or  (1)  in  the  general  case,  being  a  p.g.  fn.  For  his  purposes, 
this  was  adequate:  it  did  not  appear  to  affect  the  approach  to  his 
characterization  results.  However,  the  problem  of  trying  to  find  what 
a's  are  permissible  does  seem  worthy  of  further  consideration.  One  might 
even  regard  a  characterization  such  as  Doss's  Theorem  1  as  relatively 
powerless  unless  this  problem  is  solved.  Nevertheless,  as  the  remainder 
of  this  section  will  indicate,  the  problem  of  actually  characterizing  the 
class  of  permissible  a's  appears  to  be  a  difficult  one. 


In  order  that  a  function  of  the  form  (1)  with  H  given  by  (2)  be  a 
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p.g.  fn. ,  it  is  necessary  that: 

(a)  it  possess  a  power  series  expansion  about  the  origin, 
(absolutely)  convergent  for  (s^,...,sm)  in  some  non-trivial  region  C; 

(b)  this  series  be  such  that  all  its  coefficients  are  non-negative; 
and  (c)  G(l, ... ,1)  =  1. 

Restricting  still  to  the  bivariate  case,  we  examine  first  the  convergence 
behaviour . 

In  general,  when  m  =  1  we  know  that  the  power  series  defining  any 
p.g.  fn.  will  have  a  non-trivial  radius  of  convergence  R  >  1.  For  some 
distributions  we  may  have  R  =  +°°,  in  which  case  the  series  represents  an 
entire  function.  For  the  univariate  negative  binomial  distribution  with 
p.g.  fn.  (a+bs)  \  where  a,y  >  0  and  a  +  b  =  1,  we  have  R  =  | a/b  |  <  +<*> . 

Mien  m  >  1,  an  even  when  m  =  2,  the  situation  is  usually  much  more 
complicated.  Corresponding  to  any  power  series  of  the  form 


m 


,neZ+ 


m  n 
c  s.s, 
mn  1  2 


(5) 


there  is  a  set,  F,  of  pairs  (r^r^)  of  so-called  associated  radii  of 
convergence  (cf.  Behnke  and  Thullen  (1970),  § 1 1 1 . 1 ;  Fuks  (1963),  p.  46), 
any  pair  being  such  that  the  series  converges  absolutely  for 
^ S1  I <  rl  ’  ls2^<r2  diverges  for  |s^|>r^  ,  |s7|>r7.  One  can  consider 
also  maximal  radii  of  convergence  inhere  these  are  defined  as  upper 
limits:  R^  =  sup{r^:  (r^,r^)  e  T},  i  =  1,2.  It  is  known  that  I’  is  a 
curve  which  can  be  described  by  either  of  the  equations  r^  =  <J>(r^)  or 
Tj  =  <i(r7),  say,  where  <t>  and  are  non-increasing  continuous  functions 
({3|,  p.  50;  1 1 2 ] ,  p.  46).  Thus  F  is  a  curve  of  rather  special  structure 
which  describes  the  boundary  of  the  region  C  of  convergence.  Further,  the 


image  of  C  under  the  log  map  (Sj  -*■  fr  ^  ,  s7  -»  i?n  s^)  is  a  convex  region 

cm,  p.  so). 

In  the  n.g.  fn.  case,  i.e.  when  c  >0  for  all  m,n  e  TL  and 
Umx  =  1,  it  is  natural  to  restrict  attention  at  least  to  s^  >  0  and 
s7  >  0.  Moreover,  for  a  p.g.  fn.  it  is  always  true  that  (1,1)  e  C,  or 
equivalently  that  C  contains  the  unit  square  [0,1]  x [0,1],  and  so  for 
many  purposes  one  can  further  restrict  to  this  latter  domain.  For  most 
applications  of  bivariate  p.g.  fns.,  this  domain  suffices:  indeed,  we 
have  been  unable  to  find  any  text  on  probability  theory  which  discusses 
questions  of  convergence  since  they  all  restrict  attention  simply  to  the 
unit  square.  However,  as  will  become  evident  soon,  this  does  not  suffice 
for  our  present  purposes:  the  associated  region  C  of  convergence,  or 
equivalently  the  curve  V ,  is  intimately  connected  with  the  problem  of 
deciding  when  a  convergent  power  series  (about  the  origin)  has 
non-negative  coefficients  i.e.  with  (b)  above. 

h’e  will  return  to  discuss  the  convergence  behaviour  of  the  series 
associated  with  (lb),  but  first,  partly  in  order  to  narrow  the  number  of 
special  cases  that  need  to  be  considered  when  discussing  convergence,  we 
investigate  the  non-negativity  of  the  coefficients  of  the  series. 

Ihe  paper  by  Kemp  and  Kemp  (1965)  considered  functions  G  of  the  form 

G(s)  =  expfa^s-l)  +  a2(s2-l)}  .  (6) 

An  obvious  sufficient  condition  for  this  to  be  a  p.g.  fn.  is  a^,a7  >  0, 
since  then  G  is  a  particular  compound  Poisson  distribution  (in  the  sense 
of  feller  (1968)  and  others).  By  means  of  an  elegant  but  quite 
elementary  technique,  Kemp  and  Kemp  showed  that  the  condition  a^a,  '  0 


is  also  necessary.  For  reasons  made  clear  in  their  paper,  they  called 
t he  d i st "ihut ion  corresponding  to  (6)  the  liermite  distribution.  A 
similar  technique  was  used  to  advantage  by  Milne  (1971)  and  Milne  and 
U'estcott  (1972)  to  characterize  certain  multivariate  p.g.  fns.  that  arose 
in  the  context  of  point  processes:  in  the  latter  paper,  the  distributions 
involved  were  certain  multivariate  analogues  of  the  Hermite  distribution; 
in  the  former,  it  was  shown  that  a  function  of  the  form  (4),  with  II  given 
by  (2),  ls  a  p.g.  fn.  iff  all  a's  other  than  a^  are  non-negative  (as 
usual,  a()  is  determined  by  the  normalization),  the  necessity  of  this 
condition  having  not  been  pointed  out  previously.  In  order  to  facilitate 
a  related  approach  and  an  appreciation  of  its  limitations  in  the  context 
of  multivariate  negative  binomial  distributions,  we  describe  the  basic 
technique  in  the  original  Kemp  and  Kemp  setting. 

Since  (0)  is  an  entire  function  for  all  aj,a7,  its  power  series 
expansion  about  s  =  0  must  converge  for  all  (real)  s,  ;uul  because  of  the 
exponential  fonn  of  G,  we  must  have  G(s)  >  0  whenever  s  >  0.  Hence  we 
can  consider  in  G(s).  If  the  coefficients  in  the  power  series  are  to  be 
all  non-negative,  then  both  G(s)  and  Hn  G(s)  must  be  non-decreasing 
functions  of  s  when  s  is  non-negative.  It  follows  that  d/ds  >n  Cts!  e, 
i . e .  that 

a^  +  2a^  s  >  0  i  " ) 

for  all  s  >  0.  hitting  s  =  0  shows  that  a^  must  be  non-negative,  while 
letting  s  >  yields  a,  >  0.  Thus  a.,a?  >  0  is  a  necessary  condition 
for  (6)  to  lie  a  p.g.  fn. 

Hie  critical  properties  needed  to  obtain  this  result  were: 
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t 


and  (ii)  that  one  could  let  s  -*■  ®  in  (7) . 

Property  (i)  was  a  consequence  of  the  (exponential)  form  of  G  while 
property  (ii)  flowed  from  the  fact  that  G  was  entire,  i.e.  an  analytic 
function  for  all  s.  In  the  cases  considered  by  the  other  authors 
mentioned  above,  the  technique  generalized  straightforwardly:  in 
particular,  the  functions  involved  were  still  entire  functions  albeit  of 
several  variables. 

Now  consider  G  given  by  (lb)  with  y  >  0  and  suppose  that  this  has  a 
power  series  expansion  about  (0,0)  with  non-negative  coefficients.  Since 
only  non-negative  powers  of  s^  and  s^  appear  we  must  have  a^  >  0,  and 
this  ensures  that  G(SpS,)  >  0  for  any  (SpS2)  in  the  region  C  of 
convergence  associated  with  this  series.  From  the  non-negativity  of  the 
coefficients  in  the  expansion,  we  deduce  that  G(SpS?)  is  a  non-decreasing 
function  of  each  of  its  arguments,  and  hence  that  Cn  G(SpS2)  is 
similarly  non -decreasing  in  each  of  its  arguments  when  these  are 
non-negative.  Thus  we  must  have 

~  Sn  C.(SpS2)  >  0  ,  i  =  1,2  , 

'  i 

or  equivalently, 


a2  +  312S 1  "  0 

for  all  (SpS^J  e  C.  Setting  s^  =  0  =  s^  yields  a^  <  0 
In  addition,  it  follows  that 


(8) 


and  a^  s  0. 


I 
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a.j  2  -  min 


~al  a2 
r2  ’  rl 


for  all  (rpr,)  e  F.  Thus  the  conditions 


a0  >  0 


a^<  0  , a2^ 0 


a^2  s  inf  min 


(r1,r2)er 


~al  "a2 
r2  ’  rl 


(9a) 

(9b) 

(9c) 


are  necessary  conditions  for  a  function  of  the  form  (lb)  to  have  a  power 
series  expansion  about  (0,0)  with  all  its  coefficients  non-negative.  If 
(lb)  is  in  fact  a  p.g.  fn. ,  then  we  can  add  the  further  condition 
0(1,1)  =  1,  which  implies  that 

VVV  ai2  ■  1 

and  hence  that  a^  >  1 . 

In  general,  it  seems  to  be  difficult  to  reduce  the  condition  on  a^7 
to  something  simple.  A  notable  exception  arises  if  at  least  one  of  a^ 
and  a 2  is  zero.  Then  (9c)  reduces  to  5  0.  When  both  a^  and  a7 
are  negative,  it  is  possible  for  a^  t0  be  positive:  this  happens,  for 
example,  as  we  have  already  observed,  in  the  independence  case.  Since 
(1,1)  t  C  whenever  G  is  a  p.g.  fn.,  we  can  assert  that,  irrespective  of 
the  particular  structure  of  F,  we  must  have 

a12  <  minf-aj, -a2)  .  (10) 


In  the  light  of  the  form  of  C  in  special  cases  that  will  be  discussed 


shortly,  it  is  clear  that  (10)  is  a  strictly  weaker  condition  than  (9c). 


Another  variant  of  the  condition  (9c)  on  a^  can  be  obtained  as  follows. 
Set  Rj  =  supfrjt  (r^l)  e  D,  =  supir.,:  (l,r?)  e  r).  These  are,  in 
fact,  the  radii  of  convergence  of  the  respective  marginal  p.g.  fns.  Using 
these  and  (9c)  leads  to 


a 


12 


s  min 


(ID 


which  will,  in  general,  be  strictly  weaker  than  (9c)  but  stronger  than 

(10). 

It  may  well  be  that  the  conditions  (9)  are  sufficient  as  well  as 
necessary,  but  so  far  we  have  been  unable  to  prove  or  disprove  this. 


We  now  classify  the  relevant  convergence  behaviour  associated  with 
(lb).  Clearly,  for  Sj,s2  2  0  the  power  series  expansion  of  (lb)  about 
the  origin  will  not  converge  beyond  F,  the  closest  branch  to  the  origin 
and  in  the  positive  quadrant  of  the  curve  given  by 


a12sls2  +  a2s2  +  al5l  + 


ao  ■  " 


(12) 


Ihus  the  region  of  convergence  C  will  be  the  region  between  the 
coordinate  axes  and  the  curve  F.  Recall  that  if  G  is  a  p.g.  fn.,  then  by 
(9),  a()  0  and  a^,a2  s  0. 


CASF.  I:  a]2  =  0 

Here  the  curve  F  is  simply  the  straight  line 
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which  cuts  the  respective  axes  at  -ap/aj  and  -a()/n-,.  // 

f  rom  now  on,  we  can  and  will  assume  that  a^  *  0.  Hence  1 12)  can  be 
rewritten  as 


aT 


S1S2 


s2  + 


12 


S1 


12 


or  equi valent lv  as 


a2 

L  +  M 

S1  +  a12 

2  a12 

^ala2  "  a0a 12^ 


“T 

a12 


MSI 


lor  our  purposes,  it  is  enough  to  consider  cases  where  the  RMS  of  il.V)  is 
non  negative,  or  equivalently  a^a^/a^  £  a\2‘  Ior  suPPose  the  contrary. 
'Ihen  substituting  in  the  first  inequality  of  (8)  would  yield 


ala2 

al  +  S2  '  0 


for  all  s ^  >  0,  or  equivalently  that  S£  <  -a^/a.,  tor  all  s7  >  0.  Since 
this  is  a  contradiction,  we  must  have  a^a^/a^  s  a^7 . 


CASH  II:  a  |  =  a^a7/a^ 

Here  r  reduces  to  a  pair  of  straight  lines  given  by  the  equations 


a  a 

s,  +  =0  ,  s_  +  - —  =  0, 

1  a12  12 


and  G  becomes 
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Cms1,s2I  =  (a()fl  ♦  aj/a0  Sj  )(1 +  a2/a()  s7j  . 

Clearly  this  is  the  "independence"  case. 

I  rom  now  on,  we  assume  in  addition  to  Up  *  0  that  ap  ^  aja7/a(). 

CASH  III:  a  =  0  =  a. 

Recall  that  we  are  then  forced  bv  (10)  to  have  aj ,  <  0.  Ihen  "  is 
the  upper  branch  of  the  curve 

'a0 

sis>  =  • 

1  .  a12 

which  is  a  rectangular  hyperbola  whose  asymptotes  are  the  coordinate 
axes.  In  this  case,  G  becomes 


Gisj.s,)  =  (an*a12slS2) 


-Y 


This  reflects  a  situation  of  complete  dependence:  for  the  correspond ing 
random  variables  X,Y  we  have  X  =  Y  with  probability  one,  and  each  has  a 
negative  binomial  distribution  with  p.g.  fn.  (a^apS)"1  .  Notice  that 
here  =  +<>•  =  R2 ;  in  the  previous  two  cases,  both  these  quantities  were 
finite. 

CASH  IV  (  i  1  :  a^  =  <>  ,  a  ,  <  0 

Again  we  must  have  =  0.  Here  T  is  the  upper  branch  of 


S2  +  a 


12 


-ar 


*12 


vo..— 

*  _ _  _ 
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which  is  a  rectangular  hyperbola  with  asymptotes  s^  =  0  and  s^  =  -aj/aj7, 
the  latter  quantity  being  negative.  This  hyperbola  cuts  the  s^-axis  at 
-a()/aj.  Thus  =  -a()/a1>  but  =  +« .  // 

(IASI i  IV  ( i  i) :  a^  0  ,  a7  =  0 

lliis  follows  by  symmetry  from  the  previous  case.  // 

G\Sli  V:  a1-0,a2<0 

Here  again  the  curve  r  is  one  of  the  branches  of  a  rectangular 
hyperbola,  the  general  equation  being  given  by  (13)  and  the  asymptotes  by 


s,  =  — —  ,  s. 
1  a12  ‘ 


12 


Two  subcases  need  to  be  distinguished: 

(i)  aj  ,  <•  0.  In  this  case,  the  asymptotes  meet  in  the  negative 
quadrant,  and  the  relevant  branch  of  the  hyperbola  is  the  upper  one. 

(ii.)  ('  Up  <  a^/a^.  Here  the  asymptotes  meet  in  the  positive 
quadrant.  The  lower  branch  of  the  hyperbola  is  clearly  the  relevant  one. 

In  each  of  these  subcases,  the  intercepts  with  the  respective 
coordinate  axes,  and  hence  the  respective  values  of  Rj  and  R?,  are  given 
by  -a()/a j  and  -a^/a^.  // 


It  is  instructive  to  draw  the  region,  C,  of  convergence  for  each  of 
the  cases  and  to  verify  directly  that  (1,1)  e  C.  With  this,  we  have 
completed  our  discussion  of  the  convergence  behaviour  associated  with 
(lb)  in  all  cases  where  this  is  a  p.g.  fn.  It  is  easy  to  see  that  the 
independence  case,  Case  II,  can  be  viewed  as  a  limit  of  Case  Y  (ii)  as 
a^  ■*  aia2^a0'  Similarly,  Case  I  may  be  considered  as  a  limit  of 
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Case  V  ( i )  as  a^ -*■  0  or  of  Case  V  (ii)  as 

Observe  that  Case  V  (ii)  is  the  only  one  in  which  condition  (9c) 
will  come  into  play  as  a  necessary  condition  for  the  power  series 
expansion  of  (lb)  about  (0,0)  to  have  non-negative  coefficients.  Since 
in  this  case  the  curve  F  is  bounded  and  strictly  concave,  it  seems 
reasonable  to  expect  that  the  infinitum  in  (9c)  will  be  achieved  for 
exactly  one  pair  C r j ,  r^ )  e  r  for  each  fixed  (a^a^)  with  a^  <  0  and 
a;  <  0.  It  seems  difficult,  however,  to  simplify  (9c)  algebraically,  even 
in  this  case. 


lor  m  •  2,  the  situation  will  be  much  more  intricate.  'Ihe  analogue 


of  (8)  will  be  provided  by  the  conditions 


)  a,  I!  s.  <  0  ,  i  =  1  ,2,. . .  ,  m  ,  (14) 

k  ~  ir  k\ ( i )  J 


where  the  sum  is  over  all  distinct  multi -indices  k  c  {l,...,m}  such  that 
H!  ^  k,  the  empty  product  being  interpreted  (as  usual)  as  one,  and  the 
conditions  must  be  satisfied  by  all  (Sj,...,s  )  c  C,  the  convergence  set 
which  is  defined  by  analogy  with  the  bivariate  case. 

When  m  3,  the  conditions  (14)  become 


which  must 
()  yields 


al  +  a12s2  +  a13s3  +  a123s2s3  *  ° 


a2  +  a12sl  +  a23S3  +  a123sls3  *  0 


a3  +  a 1 3s 1  +  a23s2  +  a123sls2  "  ° 


bi'  satisfied  for  all  (Sj,s,,s^)  c  C.  lotting  s j  =  s,  : 
ara2’a3  *  (l*  s3  =  ()  in  (15a)  ;ind  (15b)  yields 


( 1 5a ) 
(15b) 
( 1 5c  1 


a j  +  a j  ,s ,  <  0  and 


a12Sl 


which  must  he  satisfied  for  all  such  that  (Sj,s2,0)  e  C.  There 

will  be  four  more  conditions  similar  to  this  last  pair  and  obtained  by 
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setting  s^  =  0  in  (15a)  and  (15c)  and  s.  =  0  in  (15b)  and  (15c).  But 
these  nine  conditions  deduced  from  (15)  do  not  exhaust  all  the 
restrictions  contained  in  (15). 

A  set  of  conditions  which  may  be  useful  even  though  they  are,  in 
general,  strictly  weaker  than  (15)  can  be  obtained  by  using  the  fact  that 
(1,1,1)  must  belong  to  C.  We  then  deduce 

Uj.a^.a,  <  0  (16a) 

a ( j  <  mini  a., -a.)  ,  i,i  =  1,2,3  ,  i  <  j  (16b) 

a  i  t  3  minf-(aj+a12  +  ai;5),  -(a2  +  a12+a2^),  -(a3+a13+a23)}  ,  (16c) 

which  correspond  to  condition  (10)  in  the  bivariate  case.  Clearly  from 

(lob)  we  must  have  a..  =0  as  soon  as  at  least  one  of  a.  or  a.  is 

‘J  i  .1 

r.ero.  Since  then,  for  example,  a^  =  0  forces  a^2  =  0  and  a13  =  0,  we 
must  by  (16c)  have,  in  addition,  =  0. 

The  convergence  behaviour  for  m  >  2  will  also  be  complicated. 
However,  just  as  for  m  =  2  the  boundary  curve  !'  was  given  by  part  of  a 
hyperbola  or  a  limiting  case  thereof,  the  basic  equation  being  (12),  so 
when  m  2  the  boundary  curve  must  be  given  by  part  of  a  hyperboloidal 
surface  or  a  limiting  case  thereof,  where  the  basic  equation  is 

"<sl’-’Sm)  =  0  • 

lo  catalogue  the  behaviour  would  be  messy  and  not  particularly 
enl ightening. 
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4.  CONCLUDING  REMARKS 

Although  the  discussion  so  far  has  been  devoted  to  obtaining 
necessary  and/or  sufficient  conditions  for  (1)  to  be  a  p.g.  fn.,  it  does 
seem  appropriate  to  make  some  remarks  about  one  special  subclass  of 
p.g.  fns.  of  the  form  (1).  Tins  is  the  subclass  consisting  of  those  C  of 
the  form 


m 


G(Sl,...,sm)  -  (a „  ♦  I  ajSj) 


-Y 


(17) 


where  y  >  0.  From  52  and  §3,  it  is  clear  that  such  a  function  is  a 
p.g.  fn.  iff  aQ  >  0,  ap.. .  ,am  s  0,  and  aQ  +. . .+  =  1. 

Rates  and  Neyman  (1952)  were  the  first  to  give  a  systematic  study  of 
distributions  with  p.g.  fns.  of  the  form  (17),  although  A.  Guldberg  (1934) 
and  Arbous  and  Kerrich  (1951)  had  considered  aspects  of  the  bivariate 
case  and  S.  Guldberg  (1935)  had  noted  the  general  form  together  with  some 
properties.  Lukacs  (1973)  has  given  a  characterization  of  the  m- variate 
case  with  y  =  1,  an  m-variate  geometric  distribution,  through  a 
regression  property. 

Amongst  the  properties  considered  by  Bates  and  Neyman  was  the 
following,  a  generalization  of  which  serves,  in  fact,  to  characterize 
p.g.  fns.  of  the  form  (17)  amongst  those  of  the  form  (1):  that  for  the 
corresponding  random  variables  Xp...,Xm,  the  joint  distributions  of 
XpX^,...^^  and  X  =  X^+^  +...+  Xm  are  of  the  same  form  for  any  k  such 
that  1  <  k  <  m.  This  property  of  closure  under  addition  is  important  and 
serves  to  explain  the  appearance  of  distributions  with  p.g.  fns.  of  the 
form  (17),  but  not  those  of  the  more  general  form  (1),  as  the 
finite-dimensional  distributions  of  point  processes  whose  one -dimensional 


distributions  are  negative  binomial:  see  Gregoire  (1980)  and  Diggle  and 
Milne  (1980). 

Sibuya  et  al_.  (1964)  have  suggested  that  since  distributions  with 
p.g.  fns.  of  the  form  (17)  bear  a  relation  to  negative  binomial 
distributions  similar  to  that  between  multinomial  and  binomial 
distributions,  they  should  be  referred  to  as  negative  multinomial 
distributions.  Such  distributions  were  the  only  multivariate  negative 
binomial  distributions  to  be  discussed  in  Johnson  and  Kotz  (1969),  which 
gives  a  good  summary'  of  the  various  known  properties. 

Let  us  mention  one  other  characterization  that  may  be  possible  for 
p.g.  fns.  of  the  form  (1).  Recall  the  Dwass  and  Teicher  (1957) 
characterization,  through  infinite  divisibility,  of  p.g.  fns.  of  the  form 
(4).  This  was  quoted  earlier  in  §2.  It  is  clear  that  whenever  a  function 
of  the  form  (1)  with  H  given  by  (2)  is  a  p.g.  fn.,  it  is  infinitely 
divisible:  using  an  obvious  notation,  we  have 


G  (s-, , 

Y  1’ 


,sm^  ^y/n^sl’' 


s  )] 

m  y  J 


m 


for  each  n  =  1,2, _  We  might  conjecture  that  p.g.  fns.  of  the  form  (1) 

are  the  only  possible  infinitely  divisible  p.g.  fns.  with  negative 
binomial  marginals.  At  the  present  time,  we  have  been  unable  to  prove  or 
disprove  this  conjecture. 

Finally,  we  note  that  Hunter  (1969),  from  a  situation  with  two 
queues  in  parallel,  derived  a  functional  equation  ([16],  p.  437,  equation 
(5.7))  for  the  p.g.  fn.  of  a  bivariate  distribution,  which  he  showed  to 
have  geometric  marginal  distributions.  His  attempts  to  solve  the 
equation  failed,  and  moreover,  he  showed  that  it  was,  in  general, 
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impossible  to  choose  the  constants  a^,  a^,  and  a ^  so  that  a  p.g.  fn. 
of  the  form  (lb)  would  satisfy  the  equation. 

Omission:  I  am  aware  of  an  unpublished  manuscript  by  F.  Downton 
dealing  with  bivariate  exponential  and  geometric  distributions.  There 
may  be  in  this  work  some  material  relevant  to  the  present  discussions, 
but  unfortunately  I  am  unable  to  consult  it  at  the  moment. 
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